Optimal Control Algorithm for Subway Train Operation by Proximal Policy Optimization

نویسندگان

چکیده

With the increasing scale of urban subway, total energy consumption subway has increased dramatically and poses a great challenge to comfort passengers punctuality train operation. In order ensure on-time operation passenger comfort, at same time reduce operation, this paper proposes Proximal Policy Optimization (PPO)-based optimization algorithm for optimal control Firstly, reinforcement learning architecture is constructed with position speed as state, objectives, constraint. The proposed model trained by PPO algorithm, reward scaling added training process accelerate improve efficiency algorithm. experimental results show that can effectively while ensuring

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-objective Optimization Improved GA Algorithm and Fuzzy PID Control of ATO System for Train Operation

In order to solve the problem that automatic train operation control system considering the single factor and control is not easy to be accurate, a multi-objective optimization (MO) based on improved genetic algorithm (GA) and fuzzy PID control method is proposed in this paper. Firstly, based on train operation characteristics, a multi-objective model of train operation process is established. ...

متن کامل

Holistic optimization of train traffic by integration of automatic train operation with centralized train management

Nowadays, railways are confronted with numerous pressing problems, including capacity optimization, energy conservation, cost reduction and improving customer satisfaction. While the traditional railway is a very safe means of transportation, it still cannot meet all these requirements. Hence there are high interests in two available systems to overcome these challenges: Traffic Management Syst...

متن کامل

Proximal Policy Optimization Algorithms

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a “surrogate” objective function using stochastic gradient ascent. Whereas standard policy gradient methods perform one gradient update per data sample, we propose a novel objective function that enables multiple epochs of ...

متن کامل

Train Coordinated Optimization Operation with Regenerative Braking

Reducing the traction energy consumption plays an important role in railway energy saving. The traditional train optimization operation theories, taking the coasting as the means of energy saving, can’t fit complicated lines and is suitable for the single train only. Meanwhile, the highly complicated line will cause the risk of train collision. On the background of the “11th Five-Year” State Sc...

متن کامل

Optimal Operation of CHP Combined Heat Generation Systems Using the Crow Search Optimization Algorithm

Energy efficiency of power plants is less than 60% However, the efficiency of the CHP units can be up to 90 %.CHP units in addition to high efficiency, They reduce environmental pollutants by 13 to 18 percent. The purpose of this thesis is to use the simultaneous power and power generation plants to reach the optimal economic destination for Genco And to maximize economic profit And to minimize...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13137456